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Abstract 

In this study, the authors examine exhaustive business data on Japanese firms, which cover 
nearly all companies in the mid- and large-scale ranges in terms of firm size, to reach several 
key findings on profits/sales distribution and business growth trends. First, detailed balance is 
observed not only in profits data but also in sales data. Furthermore, the growth-rate distribution 
of sales has wider tails than the linear growth-rate distribution of profits in log-log scale. On 
the one hand, in the mid-scale range of profits, the probability of positive growth decreases and 
the probability of negative growth increases symmetrically as the initial value increases. This 
is called Non-Gibrat's First Property. On the other hand, in the mid-scale range of sales, the 
probability of positive growth decreases as the initial value increases, while the probability of 
negative growth hardly changes. This is called Non-Gibrat's Second Property. Under detailed 
balance, Non-Gibrat's First and Second Properties are analytically derived from the linear and 
quadratic growth-rate distributions in log-log scale, respectively. In both cases, the log-normal 
distribution is inferred from Non-Gibrat's Properties and detailed balance. These analytic results 
are verified by empirical data. Consequently, this clarifies the notion that the difference in shapes 
between growth-rate distributions of sales and profits is closely related to the difference between 
the two Non-Gibrat's Properties in the mid-scale range. 
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INTRODUCTION 



Distributions with a power-law tail have been found in various fields of natural and social 
science. Examples of such studies include, for instance, avalanche sizes in a sandpile model 
[lj], fluctuations in the intervals of heartbeats fish school sizes 01, citation numbers of 
scientific papers 141, frequency of iams in Internet traffic 

ran 

in Ref. |6|), land prices [3]-[9|, stock market price changes [lOj, and firm sizes Here, 
variables (denoted by x) follow the probability density function (PDF): 



city sizes (see the recent review 
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over some size threshold x+h- This is called Pareto's Law, which was first observed in the 



field of personal income 
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]. The index \i is called the Pareto index. Refer to Newman [3] 
for a useful description of Pareto's Law. 

In statistical physics, the study of distributions with a power-law tail ([[]) is significant 
because the k-th moment (x k ) = J dxP(x)x k diverges in the case of fi < k. It is impossible 
to describe the system by using the variance a 2 = (x 2 ) or the standard deviation a in the 
case of /i < 2. This feature comes from power-law behavior in the tail. Furthermore, it 
is worth noting that a large portion of the overall data are included in the power-law tail. 
For example, approximately 90% of total sales or profits in Japanese firms are included in 
the power-law tail. In economics (especially in macroeconomics), one of the major issues is 
the state of the entire economy. In this sense, it is important to clarify the nature of the 
power-law tail not only in physics but also in economics. 

In general, the power-law breaks below the size threshold x^ to suppress the divergence of 
the PDF jl4], [l5|. There are many distributions that have a power-law tail. These include, 
for instance, Classical Pareto Distribution (Pareto Type I Distribution), Pareto Type II 
Distribution, Inverse Gamma Distribution, Inverse Weibull Distribution, g-Distribution, 
A-Distribution and B-Distribution R^. In addition to these distributions, it has been 
hypothesized that many other distributions with a power-law tail follow the log-normal 
distribution for mid-sized variables below the size threshold x t h'- 
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Here, x is a mean value and a 2 is a variance. A lower bound of the mid-scale range x m \ n 
is often related to the lower bound of an exhaustive set of data. A pseudo log-normal 



distribution is approximately derived from A-Distribution or B-Distribution in the mid- 



sized range 
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The study of distributions in the mid-scale range below the size threshold x t h is as im- 
portant as the study of the power-law tail. In physics, we are interested not only in the 
mechanism generating a power-law tail but also in the reason for the tail breaking. In eco- 
nomics, we should note that the majority of firms are mid-sized. For instance, in sales or 
profits data, more than 90% of the total number of firms are in the mid-scale range. In 
this study, by examining exhaustive business data of Japanese firms that nearly cover the 
mid- and large-scale ranges, the authors investigate the relevant distributions with a power- 
law tail. This research is expected to be useful for understanding phenomena not only in 
economics but also in physics. 

On the one hand, it has been shown that Pareto's Law and the log-normal distribution can 
be derived by assuming some model. For example, a multiplicative process with boundary 
constraints and additive noise can generate Pareto's Law [17|. On the other hand, by using 
no model, Fujiwara et al. have recently shown that Pareto's Law (JT]) is derived from Gibrat's 
Law and from the detailed balance observed in the large-scale range of exhaustive business 
data [18|]. The relations among laws observed in exhaustive business data are important for 
examining the characteristics of distributions based on firm-size. For instance, in the study 
of Fujiwara et al., it was found that Pareto index fi is related to the difference between 
a positive growth-rate distribution and a negative one. Furthermore, along the lines of 
their study, one of the authors (A. I) has shown that the log-normal distribution (T5]) can 
be inferred from detailed balance and from Non-Gibrat's Property observed in the profits 
data of the mid-scale range [ji]] . The study of the growth-rate distribution is an interesting 
subject in itself, and an ongoing investigation into this issue has progressed recently [20 1 . 

Detailed balance means that the system is thermodynamically in equilibrium, the state 
of which is described as 

Pj(x T ,x T+1 ) = Pj(x T+1 ,x T ) . (3) 

Here, xt and £t+i are firm sizes at two successive points in time. In Eq. ([3]), the joint PDF 
Pj(xt,xt+i) is symmetric under the time reversal exchange xt %t+i- 

Gibrat's Law and Non-Gibrat's Property are observed in the distributions of firm-size 
growth rate R = xt+i/xt- The conditional PDF of the growth rate Q(R\xt) is defined 



as Q(R\x T ) = Pj(x T ,R)/P(x T ) by using the PDF P(x T ) and the joint PDF Pj(x T ,R). 
Gibrat's Law, which is observed in the large-scale range, implies that the conditional PDF 
Q(R\xt) is independent of the initial value xt Q: 

Q{R\x T ) = Q(R) . (4) 



Sutton 22] provides an instructive resource for obtaining the proper perspective on Gibrat's 
Law. 

Non-Gibrat's Property reflects the dependence of the growth-rate distribution on the 
initial value xt- The following properties are observed in the mid-scale range of positive 



profits data of Japanese firms 
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Q(R\x T ) = d(x T ) i?-*+^)- x for R > 1 , (5) 
Q{R\x T ) = d(x T ) R+t-^- 1 for R < 1 , (6) 
t±(xr) = ±a lnxT + C± . (7) 

Here, a and C± are positive constants. In this composite Non-Gibrat's Property (JHJ) ([TJ) , 
the probability of positive growth decreases and the probability of negative growth increases 
symmetrically as the initial value xt increases in the mid-scale range. It is particularly 
noteworthy that the shape of the growth-rate distribution (EI) ([6j) uniquely determines the 
change in the growth-rate distribution ([7]) under detailed balance ([3]). Moreover, the rate- 
of-change parameter a appears in the log-normal distribution ()2]). We designate flS])-© as 
Non-Gibrat's First Property to distinguish it from another Non-Gibrat's Property that is 
observed in sales data. 

The shape of the growth-rate distribution (JHI) — (EI) is linear in log-log scale. This type 
of growth-rate distribution is observed in profits and income data of firms (for instance 

in n 

23[, [24|, [25!). In contrast, it has been reported in various articles that the growth-rate 



distributions of assets, sales, number of employees in firms, and personal income have wider 



s than those of profits and income in log- log scale (for instance [26J, [l8|], [27], [28], [29], 
25]). In this case, the shape of the growth-rate distribution is different from Eqs. (j5]) and 
(|6]). There must be, therefore, another Non-Gibrat's Property corresponding to this shape. 
In fact, it has been reported in several studies that a Non-Gibrat's Property different from 
Non-Gibrat's First Property exists in the mid-scale range of assets and sales of firms (for 



instance 



30j-[32j). 



In this study, we report the following findings by employing the sales data of Japanese 
firms, which include not only data in the large-scale range but also those in the mid-scale 
range. 

1. Detailed balance (EJ) is confirmed in the mid- and large-scale ranges of sales data. 

2. In not only the large-scale range but also the mid-scale range of sales data, the growth- 
rate distributions have wider tails than those of profits in log-log scale. 

3. Under detailed balance ([3]), the allowed change of the growth-rate distribution in the 
mid-scale range is analytically determined by using empirical data. The change is 
different from that of profits. We call this Non-Gibrat's Second Property. 

4. A log-normal distribution is derived from Non-Gibrat's Second Property and from 
detailed balance. This is verified with empirical data. 

From these results, we conclude that the shape of the growth-rate distribution determines 
the type of Non-Gibrat's Property in the mid-scale range. 



NON-GIBRAT'S FIRST PROPERTY 

In this section, we review the analytic discussion in Ref. Jisj] and confirm it by applying 
the results to newly obtained data. In the analytic discussion, detailed balance ([3]) and the 
shape of the growth-rate distribution ([5])-([6]) lead uniquely to a change in the growth-rate 
distribution (j7]). In addition, Non-Gibrat's First Property and detailed balance derive a 
log-normal distribution (T5]) in the mid-scale range. 

In this study, we employ prohts and sales data snpplted by the Research .nstitute of 
Economy, Trade and Industry, IAA (RIETI) [33]. In this section we analyze profits data, 
and sales data are analyzed in the next section. The data set, which was created by TOKYO 



SHOKO RESEARCH, LTD. |34j in 2005, includes approximately 800,000 Japanese firms 
over a period of three years: the current year, the preceding year, and the year before that. 
The number of firms is approximately the same as the actual number of active Japanese 
firms. This database is considered nearly comprehensive, at least in the mid- and large-scale 
ranges. In this study, we investigate the joint PDF Pj(xt,xt+i) and the distribution of 
the growth rate R = xt+i/xt- Therefore, by using data of each firm in the previous three 
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FIG. 1. Scatter plot of positive profits in the database. Here, xt and xt+i are positive profits of 
individual firms in consecutive years. 



years, we analyze a data set that has two values at two successive points in time as follows: 
(xt,xt+i) = (data in preceding year, data in current year) U (data in year before last, 
data in preceding year). Here, U indicates set-theoretic union. This superposition of data is 
employed in order to secure a statistically sufficient sample size. This procedure is allowed in 
cases where the economy is stable, that is, thermodynamically in equilibrium. The validity 
is checked by detailed balance, as described below. 

First, detailed balance ([3]) is observed in profits data. Note that only positive-profits data 
are analyzed here, since we assume that non-negligible negative profits are not listed in the 
database. Negative-profits data are thus not regarded as exhaustive. We employ "622,420" 
data sets (xt, £t+i) that have two positive profits at two successive points in time. Figure. [U 
shows the joint PDF Pj(xt,xt+i) as a scatter plot of individual firms. Detailed balance 
03]) is confirmed by the Kolmogorov-Smirnov (KS), Wilcoxon-Mann- Whitney (WMW), and 
Brunner-Munzel (BM) tests. In the statistical tests, the range of xt is divided into iV bins 
as io < i\ < ■ ■ ■ < i n -\ < i n < • ■ ■ < in to approximately equalize the number of data in each 
bin "xt G [i n _i,i n ) and xt > xt+i" Here, io and in are the lower and the upper bounds 
of xt, respectively. We compare the distribution sample for u Pj(xt G [in-i, in), x t+i) and 
xt > xt+i" with another sample for "Pj(xt,x t +i G [i n -i,i n )) and xt < xt+i" (n = 
1, 2, • • • , N) by making the null hypothesis that these two samples are taken from the same 
parent distribution. 
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FIG. 2. Each p value of the WMW test for the scatter plot of positive-profits data points in Fig. (TJ 

Each p value of the WMW test for the case of N — 2000 is shown in Fig. 12 Note that 
the profits data contain a large number of same-value amounts, which are round numbers: 
100, 200, • • 1000, 2000, ■ • •, 10000, 20000, ■ • -. This phenomenon is frequently observed 
in economic data. A bin with a round-number amount may contain an exceptionally large 
number of data in this method of division. For the case of N = 2000, almost all bins 
typically contain 200 data; however, a bin with the round number of 5000, for instance, 
contains an exceptional 4437 data. In order to generally equalize the average amount of 
data in bins to the typical value, an appropriate number of empty bins are inserted at such 
bins of round-number amounts as needed (Fig. [3]). In the case of N = 2000, there are 759 
empty bins. P values with respect to the remaining 1241 bins are depicted in Fig. (2J in 
which 1141 p values exceed 0.05. Regardless of the division number N and the kind of test, 
p values exceed 0.05 in approximately 92% of bins. This means that the null hypothesis is 
not rejected within the 5% significance level in approximately 92% of the range. This result 
does not change in the case where the range of xt is divided into logarithmically equal bins. 
Consequently, the detailed balance (j3J) in Fig. [1] is generally confirmed. 

Second, we divide the range of the initial value xt into logarithmically equal bins as 
xt G [10 1+0 - 4 ( n ~ 1 \ io 1+0 4n ) (n = 1, 2, • • • , 15) in order to identify the shape of the growth- 
rate distribution and the change as the initial value xt increases. The conditional PDFs 
q(r\xT) of the logarithmic growth rate r = log 10 R are shown in Figs. HH6l In Figs. [5] and 
[61 the growth-rate distributions in the mid- and large-scale ranges are approximated by a 
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FIG. 3. A bin with a round-number amount contains an exceptionally large number of data. In 
order to generally equalize the average amount of data in bins to the typical value, empty bins are 
inserted at bins with round-number amounts as needed. 




FIG. 4. Conditional PDFs of positive-profits growth rate in the low-scale range (10 1 < x T < 10 3 ). 
Here, xt and xt+i are positive profits in consecutive years, in thousand yen. 



linear function of r: 

\og 10 q(r\xT) = c(xt) — t + (xx) r for r > , (8) 
\og 1Q q(r\xT) = c(xt) +t_(a>r) r for r < . (9) 

The approximation (JHJ)-(|HJ) is equivalent to Eqs. (|5]) and (jBJ) by using relations log 10 g(r|a>r) = 
log 10 g( J R|x T ) + r + log 10 (lnlO) and d(x T ) = 10 c ^ / In 10. From dR Q{R\x T ) = 1, the 
normalization coefficient d{xx) (or the intercept c(xt)) is determined as 

1 1 do) 



d(x) t+(x) t-(x) 



Following the discussion in a previous work 19j , we derive the change in the growth-rate 



distribution (JTJ) from the shape of the growth-rate distribution (JHl)-® under detailed balance 
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FIG. 5. Conditional PDFs of positive-profits growth rate in the mid-scale range (10 3 < x T < 10 5 ). 
Here, xt and xt+i are positive profits in consecutive years, in thousand yen. 




FIG. 6. Conditional PDFs of positive-profits growth rate in the large-scale range (10 5 < x T < 10 7 ). 
Here, xt and xt+i are positive profits in consecutive years, in thousand yen. 



((3]) and then derive the log- normal distribution in the mid-scale range. Under the exchange 
of variables from (xt,xt+i) to (x T ,R), two joint PDFs Pj(xt,xt+i) and Pj(xt,R) are 
related to each other as Pj(xt, R) = xtPj(xt, xt+i)- Substituting the joint PDF Pj(xt, R) 
for the conditional PDF Q(R\xt) and using detailed balance ([3]), we obtain 

Pjxr) 1 QjR- 1 \x T+ i) , . 

P(x T+ i) R Q(R\x T ) ' 1 } 

By substituting the conditional PDF for the shape of the growth- rate distribution fl5])-((6]), 



another expression of detailed balance (fTTj) is reduced to 

P ( X t) _ R +t+(x T )-t-{x T+1 )+l 



(12) 



P{XT+1) 

for the case of R > 1. Here, we denote P(x) = d(x) P(x). By expanding Eq. f JT2|) around 
i? = 1 with xt — > x and xt+i Rx, the following three differential equations are obtained: 



1 + t+(x) - t-(x) P(x) + x P (x) = , 
t+ (x) + i_ (x) = , £+ (x) + x £+ (x) = . 



(13) 
(14) 



The same differential equations are obtained for i? < 1. Equations f }T4"|) uniquely fix £±(xt) 
as Eq. ((7|). Now, let us verify this by empirical data. 

Figure shows t±(xx) and c(xt) estimated by fitting the approximation (jEJ)-© to each 
growth-rate distribution in Figs. HHH1 In Fig. [7J c(xt) is fixed as the empirical value and 
t±(xr) is estimated by using the least-squares method. In Fig. HJ the linear function (JS])- 
09]) is difficult to approximate for each growth-rate distribution, and the values for n = 
1, 2, • • • , 5 in Fig. [7] are untrustworthy. In Fig. |5l however, the linear approximation ([8]) (J9j) 
is appropriate. Applying the change in the growth-rate distribution t±(xx) (CJ) to n = 6, 7, 8 
(10 3 < xt < 10 4 ' 2 ) in Fig. [TJ we obtain the rate-of-change parameter a = 0.11 ± 0.02 
from £ + (xt) and a = 0.11 ± 0.03 from £_(xt) by using the least-squares method. This 
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FIG. 7. Estimations of c{xt) and t±(xr). Here, xt is the lower bound of each bin, in thousand 
yen, and c{xt) is the original value of the growth-rate distribution. From left, each point on the 
graph represents n = 1, 2, • • • , 15. 
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FIG. 8. Estimations of ct±(xt)- Here, xt is the lower bound of each bin, in thousand yen. From 
left, each point on the graph represents n = 1, 2, • ■ ■ ,15. 



coincidence of two estimated values guarantees Non-Gibrat's First Property ©-([7]) in the 
empirical data. We regard 10 3 < xt < 10 4,2 as the mid-scale range. 

In Fig. |6l the growth-rate distribution barely changes as n increases. This means that 
Gibrat's Law (j3J) is valid in the large-scale range. In Fig. [7J the values t±(a>r) vary in the 
large-scale range, since the number of data in Fig. [6] is statistically insufficient to estimate 
t±(xx) by the least-squares method. However, by measuring the positive and negative 
standard deviations a± of each growth-rate distribution in Figs. 0H6J we confirmed that the 
growth-rate distribution only slightly changes in the range Xt > 10 5 (Fig. [8]). From Fig. |8l 
we regard Xt > 10 5 as the large-scale range and set a = in this range. Strictly speaking, 
a constant parameter a must not take different values. However, in the database, a large 
number of firms stay in the same range for two successive years. This parameterization is, 
therefore, generally suitable for describing the PDF. 

In Fig. [71 c(xt) = log 10 (<i(xT) In 10) hardly changes in the mid- and large-scale ranges 
xt > 10 3 . This is consistent with C± » alnxx in Eqs. flTJ) and (|T0|) . Consequently, by 
approximation we determine that the dependence of cI(xt) on xt is negligible in the mid- 
and large-scale ranges. Using t±(x) (J7J), Eq. (fl3]l uniquely decides the PDF of x as 

P(x) = C x~^ +1 ^ exp — a\n 2 x for x > x min . (15) 

Here, we regard d(x) in P(x) = d(x) P( OC^j clS cL constant and denote \x = C + — C_. The 
solutions (jTJ) and f|T5|) satisfy Eq. f|T2|) beyond perturbation around R — 1, and thus these 
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are not only necessary but also sufficient. 

Figure [9] shows that the resultant PDF (|15p fits correctly with the empirical profits data. 
In the large-scale range (a = 0), the PDF (ITS]) behaves as Pareto's Law (pQ). The Pareto 
index is estimated as approximately fi ~ 1 in the large-scale range (x > 10 5 ) of Fig. [9J In the 
mid-scale range, the PDF (II 5p behaves as the log- normal distribution (j2J) with a = 1 /(2a 2 ), 
ji = — lnx/(cr 2 ). Applying the PDF ( fl5|) to the mid-scale range (10 3 < x < 10 4 ' 2 ) of Fig. |9j 
we obtain the rate-of-change parameter a = 0.082±0.089 by using the least-squares method. 
The error bar is not small because we have applied the least-squares method to the quadratic 
curve in log-log scale. The estimated value (a = 0.082 ± 0.089) is, however, consistent with 
the values estimated by the change in t±(a>r) — 0.11 ± 0.02 or 0.11 ± 0.03). From these 
results, we conclude that Non-Gibrat's First Property is confirmed by the empirical data. 
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FIG. 9. A PDF of positive profits in the database. Pareto's Law is observed in the large-scale 
range (x > 10 5 ) and in the log-normal distribution in the mid-scale range (10 3 < x < 10 4 ' 2 ). 

NON-GIBRAT'S SECOND PROPERTY 

In this section, we investigate another Non-Gibrat's Property observed in the mid-scale 
range of sales data. This is the main aim of this study. First, detailed balance ([3]) is also 
observed in sales data. Here, we employ "1,505,108" data sets (xt, £t+i) that have two sales 
at two successive points in time. Figure [TU] shows the joint PDF Pj( scatter 
plot of individual firms. Detailed balance ([3]) is also confirmed by using the KS, WMW, and 
BM tests in the same manner as in the previous section. Figure ITT1 shows each p value of the 
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FIG. 10. Scatter plot of sales in the database. Here, xt and xt+i are sales of individual firms in 
consecutive years. 




10 3 10 4 10 5 10 6 10 7 10 8 10 9 
Salesx (inthousandyen ) 

FIG. 11. Each p value of the BM test for the scatter plot of sales data points in Fig. [TO) 

BM test for the N = 5000 case. Regardless of the division number N and the kind of test, p 
values exceed 0.05 in approximately 82% of bins. This means that the null hypothesis is not 
rejected within the 5% significance level in approximately 82% of the range. Note that the 
sales data also contain a large number of same-value amounts, which are round numbers. 
P values of the statistical test for bins with a large number of round values are unusually 
small. In this situation, 82% is acceptable. The percentage is slightly higher in the case 
where the range of xt is divided into logarithmically equal bins. We assume, therefore, that 
detailed balance ([3]) in Fig. [10] is generally verified. 
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Second, we divide the range of the initial value xt into logarithmically equal bins as xt G 
|lQ3+o.4(n-i\ io 3+0 - 4n ) (n = 1, 2, ■ • • , 15). The conditional growth-rate distributions g(r|xr) 
are shown in Figs. fTSHT^l Each growth-rate distribution in Figs. IT2TrHl has curvatures. It 
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FIG. 12. Conditional PDFs of sales growth rate in the small- and mid-scale ranges (10 3 < x T < 
10 5 ). Here, xt and xt+i are sales in consecutive years, in thousand yen. 
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FIG. 13. Conditional PDFs of sales growth rate in the mid- and large-scale ranges (10 5 < x T < 10 7 ). 
Here, xt and xt+i are sales in consecutive years, in thousand yen. 



is difficult to approximate the growth-rate distributions by the linear approximation (IE])-© 
as in the profits case. As the simplest extension, we have added a second-order term with 
respect to r to express the curvatures as follows: 

log 10 g(r|xy) = c(xt) — ^+(^t) r + In 10 « + (i T ) r 2 for r c > r > , (16) 
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FIG. 14. Conditional PDFs of sales growth rate in the large-scale range (10 7 < x T < 10 9 ). Here, 
xt and xt+i are sales in consecutive years, in thousand yen. 



log 10 q{r\xT) = c(xt) + t-{x-r) r + In 10 U-(xt) r 2 for — r c < r < 



;i7) 



Note that we must introduce a cut r c in order to normalize the probability integration as 
fiQ-lc dR Q(R\xt) = 1, since Eqs. (fT6|) and (TTTj) are quadratic with respect to r. From 
this normalization condition, c(xj<) can be expressed by using t±(xT), u±(xt), and r c . The 
expression is quite complicated, and it is later observed that c(xt) only slightly depends on 
xt in the empirical data. Therefore, we do not describe the expression here. 
The approximation (j!6p — (j!7p is rewritten as 



Q{R\x T ) = d(x T ) R-^+M+u+MinR for R>li 

Q{R\x T ) = d{x T ) R^+t-M+u-i^inR for R< i _ 

By using this shape, in the case of R > 1, detailed balance (ITTj) is reduced to 

P{x T ) 

P(x T+ l) 

By expanding Eq. ( )20l) around _R = 1 with o;t — > x and xt+i — >• R x, the following five 
differential equations are obtained: 



jj> l+t+(xT)-t-(xT+i)-[u + (x T )-u-(x T+1 )] InR 



(18) 
(19) 

(20) 



l + t+(x) -t-(x) 



P(x) + x P (x) 







(21) 



X 



t+'(x) +tJ(x) + 2 [u+(x) - u-(x)] = 



2 t+'(a;) + + &u+'(x) + x 2 t+"(x) + t-"(x) 







(22) 
(23) 
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t+ (x) + t_ (x) + 3x 



t+ (x) + £_ (x) 



+ x z 



t + (3) (x)+tJ 3) (x) 



t + '(x) + 7x t + "(x) + 6x 2 t+ (3) (x) + x 3 t + (A \x) = . 



(24) 
(25) 



The same differential equations are obtained for R < 1. Equations (T^2j) — (T25j) uniquely fix 
the change in the growth-rate distribution t±(x), u±(x) as follows: 



t+(x) 

u + (x) 
w_(x) 



7 ■ 

— In 3 x H — In 2 x + a In x + Ci 



In x + (77 — a) In x + C 2 



--ln 3 x + 
6 6 

7 l2 28 — . _ r/ 

— - In x H In x + C 3 + - . 

6 6 2 



(26) 
(27) 
(28) 
(29) 



Now, let us confirm these solutions with the empirical data. 

Figure [151 shows t±(xj<), u±(xt) and c(xt) estimated by fitting the approximation ([16]) - 
( I17p to each growth-rate distribution in Figs. [12HT41 In Fig. [151 c ( x t) is fixed as the empirical 
value and £±(xt) and u±(xt) are estimated by using the least-squares method. For n = 
13,14,15 in Fig. [H} there are not sufficient data points to estimate t±(xr), u±{xt) for 
n = 14, 15 or to estimate the error bar for n = 13. Therefore, data points for n = 13, 14, 15 
are not plotted in Fig. [15] 

On the one hand, for n — 9, 10, • • ■ , 15 (xt > 10 6 ' 2 ) in Figs. [13] and [HI the growth-rate 
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FIG. 15. Estimations of c(xt), t±(xT), and u±{xt)- Here, xt is the lower bound of each bin, in 
thousand yen, and c{xt) is the original value of the growth-rate distribution. From left, each point 
on the graph represents n = 1, 2, ■ • ■ , 12. 
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distribution hardly changes as n increases. This means that Gibrat's Law fl4]) is verified by 
the empirical data. We regard xt > 10 6 ' 2 as the large-scale range and set 7 = (3 = 5 = a = 
r] = in this range because t±(xx) and u±(xt) do not depend on x T . In Fig. EES] the values 
of t±(xT) and u±(xt) vary in this range because the number of data in Fig. [13] is statistically 
insufficient to estimate them by the least-squares method. However, by measuring positive 
and negative standard deviations a± of each growth-rate distribution in Figs. [12HTU we 
confirmed that the growth-rate distribution hardly changes in the large-scale range xt > 
lO 62 (Fig. EES]). 
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FIG. 16. Estimations of g±{xt)- Here, xt is the lower bound of each bin, in thousand yen. From 
left, each point on the graph represents n = 1, 2, • ■ ■ , 15. 



On the other hand, in Fig. [T21 while the negative growth-rate distribution hardly changes 
as n increases, the positive growth-rate distribution gradually decreases. This is Non- 
Gibrat's Property in the mid-scale range of sales data. We should estimate parameters 
7, (3, 6, a and r\ by applying the change in the growth-rate distribution (T26T) — (T29T) to Fig. [T5l 
However, there are insufficient data points in Fig. [15] for using the least-squares method by 
polynomial functions (|26l)-( l29i) . Consequently, as a first-order approximation, we assume 
that the negative growth-rate distribution does not depend on xt, even in the mid-scale 
range. This approximation is guaranteed by Fig. [16] because the negative standard devia- 
tion cr_ hardly changes compared with the positive standard deviation <j + . 

In this approximation, the parameters are simplified as 

1 = 5 = (3 = and rj = a . (30) 
17 



Only the change in the positive growth-rate distribution t + (xx) depends on xt as follows: 

t+(x) — alnx + C\ , (31) 
t.(x) = C 2 , u + (x)=C 3 , U-(x)=C 3 + -. (32) 

We call this Non-Gibrat's Second Property. 

Applying t+(x T ) to n = 3, 4, 5, 6 (10 3 ' 8 < x T < 10 5 - 4 ) in Fig.ttSJ we obtain the rate-of- 
change parameter a = 0.68±0.03 by the least-squares method. We regard 10 3 ' 8 < xt < 10 5 ' 4 
as the mid-scale range of sales. In this range, t_(a>r) and u±(xt) hardly change compared 
with t + (xT), so the approximation (|32|) is considered relevant. Nevertheless, the value a 
estimated by the difference between u + (xt) and disagrees with the value estimated 

by the change in t + (x^). Most likely, this comes from a limitation of the second-order 
approximation with respect to r ( fl6l) -( Tl71) . To fix this discrepancy, we may add a third- 
order term with respect to r. We will consider this point in the conclusion. In addition, we 
should note that the intercept c(xt) only slightly depends on z r in the mid- and large-scale 
ranges xt > 10 3 ' 8 , as in the profits case. 

Using t±(x) (12"B"I) - (|2"T1) . Eq. (|2"T|) uniquely determines the PDF of x as 



P(x) oc exp [— 1 In 4 x + — ^— In 3 x - (a - ■+) In 2 x\ . (33) 

Here, we regard d(x) in P(x) = d(x) P( constant and denote // = Ci — C%. The 

solutions f l26|) - fr29|) and fl33|) satisfy Eq. fTSOj) beyond perturbation around i? = 1, so these 
are not only necessary but also sufficient. In the approximation (130|) . the PDF is reduced to 

ex. 



P{x) oc x (At+1) exp 



■ In 2 x 
2 



(34) 



Figure IT71 shows that the resulting PDF ( 1341) fits correctly with the empirical sales data. In 
the large-scale range (a = 0), the PDF (1341) behaves as Pareto's Law ([1]). The Pareto index is 
estimated as approximately fi ~ 1 in the large-scale range (x > 10 6 ' 2 ) of Fig. [T71 In the mid- 
scale range, the PDF fl34l) behaves as the log-normal distribution (T5]) in the same manner as in 
the profits case. Applying the PDF f)34p to the mid-scale range (10 3 8 < x < 10 5,4 ) of Fig.fTTl 
we obtain the rate-of-change parameter a = 0.65 ± 0.04 by using the least-squares method. 
This is consistent with the value estimated by the change in t + (xT) (a = 0.68 ± 0.03). 
From these results, we conclude that Non-Gibrat's Second Property is also confirmed by the 
empirical data. 
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FIG. 17. A PDF of sales in the database. Pareto's Law is observed in the large-scale range 
(x > 10 6 ' 2 ) and the log-normal distribution in the mid-scale range (10 3 8 < x < 10 5,4 ). 

CONCLUSION 

In this study, we have employed exhaustive business data on Japanese firms that nearly 
cover not only the entire large-scale range but also the entire mid-scale range in terms of 
firm size. Using this newly assembled database, we first reconfirmed the previous analyses 

range, the log-normal distribution 
is derived from detailed balance and from Non-Gibrat's First Property. In Non-Gibrat's 
First Property, the probability of positive growth decreases and the probability of negative 
growth increases symmetrically as the initial value xt increases. Under detailed balance, 
this change is uniquely reduced from the shape of the growth-rate distribution, which is 
linear in log-log scale. 

Second, the following findings were reported with respect to sales data. Detailed balance is 
also observed in the mid- and large-scale ranges of sales data. The growth-rate distribution of 
sales has wider tails than the linear growth-rate distribution of profits in log-log scale. In the 
mid-scale range, while the probability of negative growth hardly changes as the initial value 
xt increases, the probability of positive growth gradually decreases. This feature is different 
from Non-Gibrat's First Property observed in the profits data. We have approximated 
the growth-rate distribution with curvatures by a quadratic function. In addition, from 
an empirical observation, we have imposed the condition that the negative growth-rate 
distribution does not depend on xt, even in the mid-scale range. Under detailed balance, 
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these approximations and conditions uniquely lead to a decrease in positive growth. We call 
this Non-Gibrat's Second Property. In the mid-scale range, the log-normal distribution is 
also derived from detailed balance and from Non-Gibrat's Second Property. These results 
are confirmed by the empirical data. 

In this study, it was clarified that the shape of the growth-rate distribution of sales is 
different from that of profits. It was also demonstrated that this difference is closely related 
to the difference between two kinds of Non-Gibrat's Properties in the mid-scale range. The 
growth-rate distribution of income of firms is approximated by a linear function in log- 
log scale as in the profits data. The growth-rate distributions of assets, the number of 
employees, and personal income have wider tails than a linear function in log-log scale, as in 
the sales data. If we obtained exhaustive data that include the mid-scale range, Non-Gibrat's 
First Property would probably be observed in the income data of firms, while Non-Gibrat's 
Second Property would probably be observed in the assets, the number of employees, and 
the personal income data. 

We have not determined what makes the difference between the shapes of the growth- 



25). 



rate distributions. However, this difference is probably related to the following factors 
Income and profits of firms are calculated by a subtraction of total expenditures from total 
sales in a rough estimate. Assets and sales of firms, the number of employees, and personal 
income are not calculated by any subtraction. 

Let us consider the distribution of added values, the sum of which is GDP. Clearly, 
added values are calculated by some subtraction. If we obtained exhaustive data of added 
values, Non-Gibrat's First Property would certainly be observed. It has been reported that 
;he growth-rate distribution of GDPs of countries is linear in log- log scale (for instance 



351]) . This report reinforces that speculation. The results in this paper should be carefully 
considered in cases where governments and firms discuss strategies of growth. 

Finally, we consider a method to fix the inconsistency by which the rate-of-change pa- 
rameter a is not estimated by the difference between u±(xt) (]3"2"]) . Let us add not only the 
second-order term with respect to r but also a third-order term as follows: 

log 10 q(r\xT) = c(xt) — £+(a>r) r + In 10 u + {xt) r 2 — In 2 10 v + (xt) r 3 for r > ,(35) 
log 10 q(r\xr) = c(xt) + t-(xx) r + In 10 U-(xr) r 2 + In 2 10 V-(xt) r 3 for r < .(36) 

In the same manner as in the previous section, under detailed balance, coefficients t±(x), 
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u±(x), and v±(x) are uniquely obtained as follows: 

t + (x) = | In 5 x + - In 4 x + | In 3 x + ^ In 2 x + a In x + C x , (37) 
= -^ln 5 x + ^— — In 4 x + ^— ^ In 3 x + ^— - In 2 x + (77 - a) lnx + C 2 , (38) 

4J: O j£ 

m+ x =-Mn 4 x — — ln 3 x-(A + ^— — In 2 x + (z/ -p-)lnx + C 3 , 39 

5 2U 12 o 

u_(x) = — - In 4 x — — In 3 x — (A H — — ) In 2 x + (1/ H — — ) In x 

5 20 12 6 

+^3 + \ , (40) 

t> + (x) = j- In 3 x + 2K J" 6 In 2 x + A In x + C 4 , (41) 

w _(x) = — -ln 3 x + — — -ln 2 x - (A )lnx + C 4 + /i . (42) 

15 20 6 

By imposing the condition that the negative growth-rate distribution does not depend 

on xt even in the mid-scale range, these are simplified as follows: 

t + (x) = ^ln 2 x + alnx + Ci , t_(x) = C 2 , (43) 

8 a 
u+(x) = --\nx + C 3 , u4x) = C 3 + -, (44) 

v+(x) = C 4 , v_(x)=C A -^. (45) 

The results in the previous sections fl3Tj) and f )32|) correspond to a special case 8 = 0, C 4 = 
in Eqs. (I43I) - (I45I) . In the previous section, it was difficult to estimate a by the difference in 
u±(x). In the expressions (|43|) - (|45p . this discrepancy is probably solved with a negative 8. 
Note that Eqs. (T26T) — f[29T) cannot be reduced to Eqs. f|43|) and f|44l) in any parameterization. 

It is technically difficult to estimate t±(x), u±(x), and v±(x) by approximating the growth- 
rate distribution by the cubic function fl35l) - fl36|) and to estimate 8 and a fitting Eqs. f|43|) - 
(145]) by the least-squares method. At the same time, under the approximation by the cubic 
function (J3"5]) - (J3T)]) . the integration / °° dR Q(R\xt) converges without a cut r c , as in the 
linear approximation. Because this work involves difficulties as well as advantages, we will 
investigate the above issues in the near future. 



ACKNOWLEDGMENTS 



The authors thank the Research Institute of Economy, Trade and Industry, IAA (RIETI) 
for supplying the data set used in this work. This study was produced from the research 

21 



the authors conducted as members of the Program for Promoting Social Science Research 
Aimed at Solutions of Near-Future Problems, "Design of Interfirm Networks to Achieve 
Sustainable Economic Growth." This work was supported in part by a Grant-in-Aid for 
Scientific Research (C) (No. 20510147) from the Ministry of Education, Culture, Sports, 
Science and Technology, Japan. Takayuki Mizuno was supported by funding from the Kampo 
Foundation 2009. 



* ishikawa@kanazawa-gu.ac.jp 
T fujimoto@kanazawa-gu.ac.jp 

* mizuno@ier.hit-u.ac.jp 

[1] P. Bak, C. Tang and K. Wiesenfeld, Phys. Rev. Lett. 59 (1987) 381; 

P. Bak, C. Tang and K. Wiesenfeld, Phys. Rev. A 38 (1988) 364. 
[2] C.-K. Peng, J. Mietus, J. M. Hausdorff, S. Havlin, H. E. Stanley and A. L. Goldberger, Phys. 

Rev. Lett. 70 (1993) 1343. 
[3] E. Bonabeau and L. Dagorn, Phys. Rev. E 51 (1995) R5220. 
[4] S. Render, Eur. Phys. J. B4 (1998) 131. 

[5] M. Takayasu, H. Takayasu and T. Sato, Physica A 233 (1996) 824. 

[6] A. Saichev, Y. Malevergne and D. Sornette, Theory of Zipf's law and beyond, Lecture Notes 

in Economics and Mathematical Systems, p. 632 (Springer, 2009). 
[7] T. Kaizoji, Physica A 326 (2003) 256. 
[8] T. Yamano, Eur. Phys. J. B 38 (2004), 665. 
[9] A. Ishikawa, Physica A 371 (2006) 525; 

A. Ishikawa, Prog. Theor. Phys. Supple. No. 179 (2009) 103. 
[10] R. N. Mantegna and H. E. Stanley, Nature 376 (1995) 46. 

[11] M. H. R. Stanley, S. V. Buldyrev, S. Havlin, R. Mantegna, M. A. Salinger and H. E. Stanley, 

Economics Lett. 49 (1995) 453. 
[12] V. Pareto, Cours d'Economique Politique (Macmillan, London, 1897). 
[13] M. E. J. Newman, Contemporary Physics 46 (2005) 323; 

A. Clauset, C. R. Shalizi and M. E. J. Newman, SIAM Review 51 (2009) 661. 



22 



[14] W. W. Badger, in B. J. West (ed.) Mathematical Models as a Tool for the Social Science, p. 

87 (Gordon and Breach, New York, 1980). 
[15] E. W. Montroll and M. F. Shlesinger, J. Stat. Phys. 32 (1983) 209. 

[16] H. Aoyama, H. Iyetomi, Y. Ikeda, W. Souma and Y. Fujiwara, ECONOPHYSICS (Kyoritsu, 

Tokyo, 2008 in Japanese). 
[17] M. Levy, S. Solomon, Int. J. Mod. Phys. C 7 (1996) 595; 

H. Kesten, Acta Math. 131 (1973) 207; 

D. Sornette, R. Cont, J. Phys. I 7 (1997) 431; 

H. Takayasu, A.-H. Sato, M. Takayasu, Phys. Rev. Lett. 79 (1997) 966. 
[18] Y. Fujiwara, W. Souma, H. Aoyama, T. Kaizoji and M. Aoki, Physica A 321 (2003), 598; 
Y. Fujiwara, C. D. Guilmi, H. Aoyama, M. Gallegati and W. Souma, Physica A 335 (2004), 
197. 

[19] A. Ishikawa, Physica A 367 (2006) 425; 

A. Ishikawa, Physica A 383 (2007) 79. 
[20] M. Riccaboni, F. Pammolli, S. V. Buldyrev, L. Ponta and H. E. Stanley, Proc. Natl. Accad. 

Sci. USA 105 (2008) 19595. 
[21] R. Gibrat, Les inegalites economiques (Sirey, Paris, 1932). 
[22] J. Sutton, J. Econo. Lit. 35 (1997) 40. 

[23] K. Okuyama, M. Takayasu and H. Takayasu, Physica A 269 (1999) 125. 
[24] A. Ishikawa, Physica A 363 (2006) 367. 

[25] A. Ishikawa, Economics 3 -Special Issue Reconstructing Macroeconomics, 2009-11. 

[26] L. A. N. Amaral, S. V. Buldyrev, S. Havlin, H. Leschhorn, P. Maass, M. A. Salinger, 

H. E. Stanley, and M. H. R. Stanley, J. Phys. I France 7 (1997) 621. 
[27] K. Matia, D. Fu, S. V. Buldyrev, F. Pammolli, M. Riccaboni and H. E. Stanley, Europhys. 

Lett. 67 (2004) 498. 

[28] D. Fu, F. Pammolli, S. V. Buldyrev, M. Riccaboni, K. Matia, K. Yamasaki and H. E. Stanley, 

Proc. Natl. Acad. Sci. 102 (2005) 18801. 
[29] S. V. Buldyrev, J. Growiec, F. Pammolli, M. Riccaboni and H. E. Stanley, J. Eur. Economic 

Association 5 (2-3) (2007) 574. 
[30] H. Aoyama, Ninth Annual Workshop on Economic Heterogeneous Interacting Agents (WEHIA 

2004); 

23 



H. Aoyama, Y. Fujiwara and W. Souma, The Physical Society of Japan 2004 Autumn Meeting. 
[31] H. Aoyama, H. Iyetomi, Y. Ikeda, W. Souma and Y. Fujiwara, Pareto Firms (Nihon Keizai 

Hyouronsha, Tokyo, 2007 in Japanese). 
[32] H. Takayasu, New way of financing firms based on the fat-tailed distribution of growth rate. 

APFA7 & Tokyo Tech. Hitotsubashi Interdisciplinary Conference (2009). 
[33] Research Institute of Economy, Trade and Industry, IAA (RIETI), 

|http:/ /www.rieti . go . j p /en/ index, html 
[34] TOKYO SHQKO RESEARCH, LTD., |http://www.tsr-net.co.jp/| 

[35] D. Canning, L. A. N. Amaral, Y. Lee, M. Meyer and H. E. Stanley, Economics Lett. 60 (1998) 
335. 



24 



